The Holumbus Framework: Creating scalable and highly customizable crawlers and indexers
نویسندگان
چکیده
Full text search is a key technology for today’s information society. The vast accessible amount of information available in public and also privately used media creates the need for efficient search engine technologies. The Holumbus framework helps users with the creation of highly customized search engine applications over structured data sets. This work covers the development and implementation of the crawler and indexer modules of Holumbus. Copyright © 2008 Sebastian M. Schlatt This work is licensed under the Creative Commons Attribution-NonCommercial 2.0 Germany License. To view a copy of this license, visit http://creativecommons. org/licenses/by-nc/2.0/de/ or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA. Layout done with the help of Timo B. Hübel’s template, LATEX, KOMA-Script and BTEX.
منابع مشابه
Dynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture
Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...
متن کاملTopic-Driven Crawlers: Machine Learning Issues
Topic driven crawlers are increasingly seen as a way to address the scalability limitations of universal search engines, by distributing the crawling process across users, queries, or even client computers. The context available to such crawlers can guide the navigation of links with the goal of efficiently locating highly relevant target pages. We developed a framework to fairly evaluate topic...
متن کاملCost Effective and Scalable Synthesis of MnO2 Doped Graphene in a Carbon Fiber/PVA: Superior Nanocomposite for High Performance Flexible Supercapacitors
In the current study, we report new flexible, free standing and high performance electrodes for electrochemical supercapacitors developed througha scalable but simple and efficient approach. Highly porous structures based on carbon fiber and poly (vinyl alcohol) (PVA) were used as a pattern. The electrochemical performances of Carbon fiber/GO-MnO2/CNT supercapacitors were characteriz...
متن کاملبررسی میزان تطابق زبان نمایهسازان، نویسندگان و برچسبگذاران در پایگاه اطلاعاتی اریک و مندلی
Objective: The purpose of this study was to identify the language consistency between indexers, authors and taggers in the ERIC and Mendeley databases. Methodology: This survey was conducted using content analysis methods and techniques to evaluate the language consistency between indexers, authors and taggers in the ERIC and Mendeley databases and also to determine common keywords. The sample ...
متن کاملDesign and Implementation of a Safe, Reflective Middleware Framework
With the rapid evolution of the global information infrastructure and ubiquitous computing environments, service providers will need to provide effective and adaptive resource management mechanisms that can serve concurrent applications in the presence of changing system conditions. Flexible, scalable and customizable middleware can be used as enabling technology for next generation systems tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008